Clustering Sequences in a Metric Space The MoBIoS Project
نویسندگان
چکیده
We are developing a [Molecular] Biological Information System (MoBIoS) based on metric space indices. Unfortunately, common similarity measures for sequence alignment do not form a metric-distance function. This is particularly vexing since the usual definition of edit distance does form a metric. Most clearly, the use of PAM log-odds matrices [2] yields higher similarity scores for more closely related sequences, an intuitively appealing result that reverses metric order. Further, log-odds scoring matrices contain negative values that can yield negative global alignment scores. This violates positivity. Use of PAM matrices also can violate symmetry and the triangle inequality.
منابع مشابه
MoBIoS: A Metric-Space DBMS to Support Biological Discovery
MoBIoS is a specialized database management system whose storage manager is based on metric-space indexing, and whose query language entails biological data types. When relational database management systems are used to support biological data, important data types are relegated to blob and unstructured text fields. Consequently, even simple, but critical queries are executed by sequentially du...
متن کاملUsing MoBIoS’ Scalable Genome Joins to Find Conserved Primer Pair Candidates Between Two Genomes
For the purpose of identifying evolutionary reticulation events in flowering plants, we determine a large number of paired, conserved DNA oligomers that may be used as primers to amplify orthologous DNA regions using the polymerase-chain reaction (PCR). We develop an initial candidate set by comparing the Arabidopsis and rice genomes using MoBIoS (Molecular Biological Information System). MoBIo...
متن کاملFixed Point Results on $b$-Metric Space via Picard Sequences and $b$-Simulation Functions
In a recent paper, Khojasteh emph{et al.} [F. Khojasteh, S. Shukla, S. Radenovi'c, A new approach to the study of fixed point theorems via simulation functions, Filomat, 29 (2015), 1189-–1194] presented a new class of simulation functions, say $mathcal{Z}$-contractions, with unifying power over known contractive conditions in the literature. Following this line of research, we extend and ...
متن کاملUsing MoBIoS' scalable genome join to find conserved primer pair candidates between two genomes
MOTIVATION For the purpose of identifying evolutionary reticulation events in flowering plants, we determine a large number of paired, conserved DNA oligomers that may be used as primers to amplify orthologous DNA regions using the polymerase chain reaction (PCR). RESULTS We develop an initial candidate set by comparing the Arabidopsis and rice genomes using MoBIoS (Molecular Biological Infor...
متن کاملA note on convergence in fuzzy metric spaces
The sequential $p$-convergence in a fuzzy metric space, in the sense of George and Veeramani, was introduced by D. Mihet as a weaker concept than convergence. Here we introduce a stronger concept called $s$-convergence, and we characterize those fuzzy metric spaces in which convergent sequences are $s$-convergent. In such a case $M$ is called an $s$-fuzzy metric. If $(N_M,ast)$ is a fuzzy metri...
متن کامل